SoS and Planted Clique: Tight Analysis of MPW Moments at all Degrees and an Optimal Lower Bound at Degree Four
نویسندگان
چکیده
The problem of finding large cliques in random graphs and its “planted" variant, where one wants to recover a clique of size ω log (n) added to an Erdős-Rényi graph G ∼ G(n, 2 ), have been intensely studied. Nevertheless, existing polynomial time algorithms can only recover planted cliques of size ω = Ω( √ n). By contrast, information theoretically, one can recover planted cliques so long as ω log (n). In this work, we continue the investigation of algorithms from the sum of squares hierarchy for solving the planted clique problem begun by Meka, Potechin, and Wigderson [MPW15] and Deshpande and Montanari [DM15]. Our main results improve upon both these previous works by showing: 1. Degree four SoS does not recover the planted clique unless ω √ n/polylog n, improving upon the bound ω n1/3 due to [DM15]. A similar result was obtained independently by Raghavendra and Schramm [RS15]. 2. For 2 < d = o( √ log (n)), degree 2d SoS does not recover the planted clique unless ω n1/(d+1)/(2d polylog n), improving upon the bound due to [MPW15]. Our proof for the second result is based on a fine spectral analysis of the certificate used in the prior works [MPW15, DM15, FK03] by decomposing it along an appropriately chosen basis. Along the way, we develop combinatorial tools to analyze the spectrum of random matrices with dependent entries and to understand the symmetries in the eigenspaces of the set symmetric matrices inspired by work of Grigoriev [Gri01a]. An argument of Kelner shows that the first result cannot be proved using the same certificate. Rather, our proof involves constructing and analyzing a new certificate that yields the nearly tight lower bound by “correcting" the certificate of [MPW15, DM15, FK03]. ∗Department of Computer Science, Cornell University. Work done as an intern at Microsoft Research New England. †Department of Computer Science, UT Austin. Work done as an intern at Microsoft Research New England. ‡Simons Fellow. Work done as an intern at Microsoft Research New England and at MIT with an NSF graduate research fellowship under grant No. 0645960. ar X iv :1 50 7. 05 23 0v 1 [ cs .C C ] 1 8 Ju l 2 01 5
منابع مشابه
Tight Lower Bounds for Planted Clique in the Degree-4 SOS Program
We give a lower bound of Ω̃( √ n) for the degree-4 Sum-of-Squares SDP relaxation for the planted clique problem. Specifically, we show that on an Erdös-Rényi graph G(n, 1 2 ), with high probability there is a feasible point for the degree-4 SOS relaxation of the clique problem with an objective value of Ω̃( √ n), so that the program cannot distinguish between a random graph and a random graph wit...
متن کاملImproved Sum-of-Squares Lower Bounds for Hidden Clique and Hidden Submatrix Problems
Given a large data matrix A ∈ Rn×n, we consider the problem of determining whether its entries are i.i.d. from some known marginal distribution Aij ∼ P0, or instead A contains a principal submatrix AQ,Q whose entries have marginal distribution Aij ∼ P1 6= P0. As a special case, the hidden (or planted) clique problem is finding a planted clique in an otherwise uniformly random graph. Assuming un...
متن کاملSum-of-Squares Lower Bounds for Sparse PCA
This paper establishes a statistical versus computational trade-off for solving a basic high-dimensional machine learning problem via a basic convex relaxation method. Specifically, we consider the Sparse Principal Component Analysis (Sparse PCA) problem, and the family of Sum-of-Squares (SoS, aka Lasserre/Parillo) convex relaxations. It was well known that in large dimension p, a planted k-spa...
متن کاملStatistical Algorithms and a Lower Bound for Detecting Planted Clique
We introduce a framework for proving lower bounds on computational problems over distributions, for a class of algorithms called statistical algorithms. For such algorithms, access to the input distribution is limited to obtaining an estimate of the expectation of any given function on a sample drawn randomly from the input distribution, rather than directly accessing samples. Most natural algo...
متن کاملStatistical Algorithms and a Lower Bound for Planted Clique
We develop a framework for proving lower bounds on computational problems over distributions, including optimization and unsupervised learning. Our framework is based on defining a restricted class of algorithms, called statistical algorithms, that instead of accessing samples from the input distribution can only obtain an estimate of the expectation of any given function on a sample drawn rand...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1507.05230 شماره
صفحات -
تاریخ انتشار 2015